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Vh ' Abstract 

, The groups Gk,i of Richard Thompson and Graham Higman can be generalized in a natural 

way to monoids, that we call Mk i, and to inverse monoids, called Invk,i', this is done by simply 
generalizing bijections to partial functions or partial injective functions. The monoids M^i have 
connections with circuit complexity (studied in another paper). Here we prove that Mk,i and InVk^i 
are congruence-simple for all k. Their Green relations J and D are characterized: Mfc ; i and Invk,i 
■ are J-0-simple, and they have k — 1 non-zero D-classes. They are submonoids of the multiplicative 

part of the Cuntz algebra Ok- They are finitely generated, and their word problem over any finite 
t ■ generating set is in P. Their word problem is coNP-complete over certain infinite generating sets. 
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1 Thompson-Higman monoids 

Since their introduction by Richard J. Thompson in the mid 1960s |26 [ 123 1 [27]. t ne Thompson groups 
■ have had a great impact on infinite group theory. Graham Higman generalized the Thompson groups 

On : to an infinite family |17j . These groups and some of their subgroups have appeared in many contexts 

and have been widely studied; see for example [91 [5l fT2l [71 fT^l [T5l l6l [HI [20] . 

The definition of the Thompson-Higman groups lends itself easily to generalizations to inverse 
monoids and to more general monoids. These monoids are also generalizations of the finite symmetric 
monoids (of all functions on a set), and this leads to connections with circuit complexity; more details 
on this appear in [TJ EJ 0] . 

By definition the Thompson-Higman group Gk,i consists of all maximally extended isomorphisms 
between finitely generated essential right ideals of A*, where A is an alphabet of cardinality k. The 
multiplication is defined to be composition followed by maximal extension: for any ip,i/j G i, we 
have tp ■ ip = max{(f o ip). Every element ip 6 Gk,\ can also be given by a bijection <p : P — > Q where 
P, Q C A* are two finite maximal prefix codes over A; this bijection can be described concretely by a 
finite function table. For a detailed definition according to this approach, see [3] (which is also similar 
to [25], but with a different terminology); moreover, Subsection 1.1 gives all the needed definitions. 

It is natural to generalize the maximally extended isomorphisms between finitely generated essential 
right ideals of A* to homomorphisms, and to drop the requirement that the right ideals be essential. 
It will turn out that this generalization leads to interesting monoids, or inverse monoids, which we call 
Thompson-Higman monoids. Our generalization of the Thompson-Higman groups to monoids will also 
generalize the embedding of these groups into the Cuntz algebras [3l [23], which provides an additional 
motivation for our definition. Moreover, since these homomorphisms are close to being arbitrary 
finite string transformations, there is a connection between these monoids and combinational boolean 
circuits; the study of the connection between Thompson-Higman groups and circuits was started in 
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[U [2] and will be developed more generally for monoids in [I]; the present paper lays some of the 
foundations for pQ. 

1.1 Definition of the Thompson-Higman groups and monoids 

Before defining the Thompson-Higman monoids we need some basic definitions, that are similar to 
the introductory material that is needed for defining the Thompson-Higman groups i; we follow 
[3] (which is similar to We use an alphabet A of cardinality \A\ = k, and we list its elements as 

A = {ai, . . . , dfc}. Let A* denote the set of all finite words over A (i.e., all finite sequences of elements 
of A); this includes the empty word e. The length of w G A* is denoted by \w\; let A n denote the set 
of words of length n. For two words u, v G A* we denote their concatenation by uv or by u ■ v; for sets 
B,C Q A* the concatenation is BC = {uv : u G B,v G C}. A right ideal of ^4* is a subset it! C A* 
such that C R. A generating set of a right ideal R is a set C such that R is the intersection of 
all right ideals that contain C; equivalently, R = CA* . A right ideal R is called essential iff -R has a 
non-empty intersections with every right ideal of A* . For words u, v G A*, we say that u is a prefix of 
v iff there exists z £ A* such that uz = v. A prefix code is a subset CC A* such that no element of 
C is a prefix of another element of C. A prefix code is maximal iff it is not a strict subset of another 
prefix code. One can prove that a right ideal R has a unique minimal (under inclusion) generating set, 
and that this minimal generating set is a prefix code; this prefix code is maximal iff R is an essential 
right ideal. 

For right ideals R' C R C A* we say that R' is essential in R iff R' intersects all right subideals of 
R in a non-empty way. 

Tree interpretation: The free monoid A* can be pictured by its right Cayley graph, which is the 
rooted infinite regular fc-ary tree with vertex set A* and edge set {(v, va) : v G A*, a G vl}. We simply 
call this the iree of A* . It is a directed tree, with all paths moving away from the root e (the empty 
word); by "path" we will always mean a directed path. A word v is a prefix of a word w iff v is is an 
ancestor of w in the tree. A set P is a prefix code iff no two elements of P are on the same path. A set 
R is a right ideal iff any path that starts in R has all its vertices in R. The prefix code that generates 
R consists of the elements of R that are maximal (within R) in the prefix order, i.e., closest to the root 
e. A finitely generated right ideal R is essential iff every infinite path of the tree eventually reaches 
R (and then stays in it from there on). Similarly, a finite prefix code P is maximal iff any infinite 
path starting at the root eventually intersects P. For two finitely generated right ideals R' C R, R' is 
essential in R iff any infinite path starting in R eventually reaches R' (and then stays in R' from there 
on). In other words for finitely generated right ideals R' Q R, R' is essential in R iff R' and R have 
the same "ends". For the prefix tree of A* we can consider also the "boundary" A^ (i.e., all infinite 
words), a.k.a. the ends of the tree. In Thompson's original definition [261 127j . G^i was given by a 
total action on {0, 1} W . In [3] this total action was extended to a partial action on A* \JA U ; the partial 
action on A* U A u is uniquely determined by the total action on A^ ; it is also uniquely determined by 
the partial action on A*. Here, as in [3], we only use the partial action on A*. 

Definition 1.1 A right ideal homomorphism of A* is a total function (p : Ri — > A* such that Ri is 
a right ideal of A* , and for all x\ G R\ and all w G A*: ip(x\w) = (f(xi) w. 

For any partial function / : A* — > A*, let Dom(/) denote the domain and let Im(/) denote the image 
(range) of /. For a right ideal homomorphism if : Ri ^ A* it is easy to see that the image Im(^) 
is also right ideal of A*, which is finitely generated (as a right ideal) if the domain R\ = Dom(</?) is 
finitely generated. 
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A right ideal homomorphism (p : R± — > P2, where R\ = Dom(^) and R2 = Im(<^), can be described 
by a total surjective function P\ — ► S2, with Pi,S*2 C A*; here P\ is the prefix code (not necessarily 
maximal) that generates R\ as a right ideal, and S* 2 is a set (not necessarily a prefix code) that 
generates R2 as a right ideal; so Pi = Pi A* and P2 = S2A*. The function P\ — ► S2 corresponding to 
</? : i?i — > i?2 is called the ia&/e of <p. The prefix code Pi is called the domain code of <p and we write 
Pi = domC(</?). When S2 is a prefix code we call S2 the image code of ip and we write £2 = imC(<p). 

Definition 1.2 An infective right ideal homomorphism is called a right ideal isomorphism. A right 
ideal homomorphism <p : R\ — > R2 is called total ijff the domain right ideal Ri is essential. And ip is 
called surjective iff the image right ideal R2 is essential. 

The table Pi — > P2 of a right ideal isomorphism ip is a bijection between prefix codes (that are not 
necessarily maximal). The table Pi — > S2 of a fotaZ right ideal homomorphism is a function from 
a maximal prefix code to a set, and the table Pi — > S2 of a surjective right ideal homomorphism is 
a function from a prefix code to a set that generates an essential right ideal. The word "total" is 
justified by the fact that if a homomorphism ip is total (and if domC(^) is finite) then <p(w) is defined 
for every word that is long enough (e.g., when \w\ is longer than the longest word in the domain code 
Pi ) ; equivalently, <p is defined from some point onward on every infinite path in the tree of A* starting 
at the root. 

Definition 1.3 An essential restriction of a right ideal homomorphism ip : Pi — > A* is a right ideal 
homomorphism <E> : R[ — > A* such that R[ is essential in R\, and such that for all x' x G R[: ip(x'i) = 

We say that ip is an essential extension of <& iff $ is an essential restriction of ip. 

Note that if <3? is an essential restriction of ip then R' 2 = Im($) will automatically be essential in 
P2 = Im(ip). Indeed, if I is any non-empty right subideal of Pi then InR[ / 0, hence ^ ^(ICiR^) 
C $(/) fl ^(P'i) = H P 2 ; moreover, any right subideal J of P2 is of the form J = $(/) where 
/ = <I> _1 (J) is a right subideal of Pi; hence, for any right subideal J of P2, / Jn R' 2 . 

Proposition 1.4 (1) Letip,& be homomorphisms between finitely generated right ideals of A* , where 
A = {ai, . . . ,0^}. Then is an essential restriction of ip iff & can be obtained from ip by starting 
from the table of ip and applying a finite number of restriction steps of the following form: Replace 
(x,y) in a table by {(xai,yai), ... , (xa k ,ya k )}. 

(2) Every homomorphism between finitely generated right ideals of A* has a unique maximal essential 
extension. 

Proof. (1) Consider a homomorphism between finitely generated right ideals ip : Pi — ► P2, let Pi be 
the finite prefix code that generates the right ideal Pi, and let S*2 = ¥>(Pi), so S*2 generates the right 
ideal P2. 

If x 6 Pi and y = (p(x) € S2 then (since <p is a right ideal homomorphism), yai = ip(xai) for 
i = 1, . . . , k. Then Pi — {x} is a right ideal which is essential in Pi, and Pi — {x} is generated 
by (Pi — {x}) U {xai, . . . ,xak}. Indeed, in the tree of A* every downward directed path starting 
at vertex x goes through one of the vertices xai. Thus, removing (x,y) from the graph of <p is an 
essential restriction; for the table of <p, the effect is to replace the entry (x, y) by the set of entires 
{(xai,yai), . . . , (xak,yak)}- If finitely many restriction steps of the above type are carried out, the 
result is again an essential restriction of <p. 

Conversely, let us show that if is an essential restriction of ip then $ can be obtained by a finite 
number of replacement steps of the form "replace (x,y) by {(xai,yai), . . . , (xa,k,yak)} in the table". 
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Using the tree of A* we have: If R and R' are right ideals of A* generated by the finite prefix codes 
P, respectively P' , and if R' is essential in R then every infinite path from P intersects P'. It follows 
from this characterization of essentiality and from the finiteness of Pi and P[ that R\ — R\ is finite. 
Hence (f and $ differ only in finitely many places, i.e., one can transform ip into $ in a finite number 
of restriction steps. 

So, the restriction <3? of ip is obtained by removing a finite number of pairs (x, y) from ip; however, 
not every such removal leads to a right ideal homomorphism or an essential restriction of ip. If (xq, yo) 
is removed from ip then xq is removed from i?i (since ip is a function). Also, since P' x is a right ideal, 
when xq is removed then all prefixes of xq (equivalently, all ancestor vertices of xq in the tree of A*) 
have to be removed. So we have the following removal rule (still assuming that domain and image 
right ideals are finitely generated): 

If & is an essential restriction of (p then ip can be transformed into $ by removing a finite set of 
strings from R\ , with the following restriction: If a string xq is removed then all prefixes of xq are 
also removed from R\; moreover, x$ is removed from R\ iff (xq, ip(xo)) is removed from (p. 

As a converse of this rule, we claim that if the transformation from (p to $ is done according to 
this rule, then $ is an essential restriction of (p. Indeed, $ will be a right ideal homomorphism: if 
$(xi) is defined then &(x\z) will also be defined (if it were not, the prefix x\ of x\z would have been 
removed), and <&(x\z) = ip(x±z) = (p(x\) z = $(xi) z. Moreover, Dom($) = R^ will be essential in 
R\: every directed path starting at R\ eventually meets R[ because only finitely many words were 
removed from Pi to form R[. Hence by the tree characterization of essentiality, R[ is essential in R\. 

In summary, if is an essential restriction of ip then $ is obtained from ip by a finite sequence of 
steps, each of which removes one pair (x, >p(x)). In Dom(^) the string x is removed. The domain code 
becomes (Pi — {x}) U {xa\, . . . , xaj,}, since {xa\, . . . , xak} is the set of children of x in the tree of A* . 
This means that in the table of ip, the pair (x, ip(x)) is replaced by {(xa\, ip(x) a\), . . . , (xak, p(%) afc)}- 

(2) Uniqueness of the maximal essential extension: By (1) above, essential extensions are obtained 
by the set of rewrite rules of the form {(xa\,ya\), . . . , (xak, yak)} — ► (x,y), applied to tables. This 
rewriting system is locally confluent (because different rules have non-overlapping left sides) and ter- 
minating (because they decrease the length); hence maximal essential extensions exist and are unique. 
□ 

Pr op osition 1 1 . 41 yields another tree interpretation of essential restriction: Assume first that a total order 
a i < 0,2 < ■ ■ ■ < Ofc has been chosen for the alphabet A; this means that the tree of A* is now an oriented 
rooted tree, i.e., the children of each vertex v have a total order (namely, va\ < va2 < . . . < vak)- The 
rule "replace (x,y) in the table by {(xa±,yai), . . . , (xak, yak)}" has the following tree interpretation: 
Replace x and y = ip(x) by the children of x, respectively of y, matched according to the order of the 
children. 

Important remark: 

As we saw, every right ideal homomorphism can be described by a table P — > S where P is a prefix 
code and S is a set. But we also have: Every right ideal homomorphism ip has an essential restriction 
ip' whose table P' — > Q' is such that both P' and Q' are prefix codes; moreover, Q' can be chosen to 
be a subset of A n for some n < max{|s| : s G S}. Example (with alphabet A = {a, b}): 



has an essential restriction 

aa. 



s\ : s 


eS} 


( aa 


ah 


\aa 


ab 



an 



Definition 1.5 The Thompson-Higman partial function monoid M^i consists of all maximal essen- 
tial extensions of homomorphisms between finitely generated right ideals of A* . The multiplication is 
composition followed by maximal essential extension. 
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In order to prove associativity of the multiplication of i we define the following and we prove a few 
Lemmas. 

Definition 1.6 By Rlk we denote the monoid of all right ideal homomorphisms between finitely 
generated right ideals of A* , with function composition as multiplication. We consider the equivalence 
relation = defined for <pi,(f2 G Rlk °U : fi = V2 iff max(<^i) = m&x(ip 2 ). 

It is easy to prove that Rlk is closed under composition. Moreover, by existence and uniqueness of 
the maximal essential extension (Prop. [1731 2)) each =-equivalence class contains exactly one element 
of Mk t i- We want to prove: 

Proposition 1.7 The equivalence relation = is a monoid congruence on Rlk, and Mf~\ is isomorphic 
(as a monoid) to RIk/=. Hence, Mk t i is associative. 

First some Lemmas. 

Lemma 1.8 If R\ R% (i = 1,2) are finitely generated right ideals with R[ essential in Ri, then 
R' x fl R' 2 is essential in Ri n R 2 ■ 

Proof. We use the tree characterization of essentiality. Any infinite path p in Ri n R2 is also in i?j 
(i = 1,2), hence p eventually enters into R\. Thus p eventually meets R[ and R' 2 , i.e., p meets R[ HR' 2 . 
□ 

Lemma 1.9 All <p\,<f>2 G Rlk have restrictions $i,$2 G Rlk ( n °t necessarily essential restrictions) 
such that: 

• $2 $1 = f2 fi, an d 

• Dom($ 2 ) = Im(<J>i) = Dom(</3 2 ) H Im(</?i). 

Proof. Let R = T)om(<p 2 ) n Im(c^i). This is a right ideal which is finitely generated since Dom(tp 2 ) 
and Im(i^i) are finitely generated (see Lemma 3.3 of [3]). Now we restrict ip\ to $1 in such a way that 
Im(<I>i) = R and Dom(<]?i) = ip^ (R), and we restrict p 2 to <& 2 in such a way that Dom($2) = R and 
Im(<3?2) = <P2(R)- Then$ 2 o$ 1 (.) and ip 2 o 921 (.) agree on (/?J~ 1 (i?); moreover, Dom($2°$i) = fx (R)- 
Since ip 2 o ipi(x) is only defined when pi(x) G R, we have $2 ^1 = f>2 ¥>i- Also, by the definition 
of R we have Dom(<l>2) = Im($i). □ 

Lemma 1.10 For all <pi,<p2 G Rlk we have: 

max(<^2 Pi) = max(max((^2) Pi) = max(<^2 max(<£>i)). 

Proof. We only prove the first equality; the proof of the second one is similar. By Lemma 11.91 we 
can restrict p\ and ip 2 to p[, respectively p>' 2 , so that <p' 2 o tp^ = p 2 o (pi, and Dom(^' 2 ) = Im^) = 
Dom(<y92) n Im((/?i); let R' = Dom(<^2) n Im(<^i). 

Similarly we can restrict pi and va.ax.{p 2 ) to 99", respectively (p 2 , so that <^ 2 ' f'{ = max((^2) *Pi, 
and Dom((/? 2 ') = Im(c^^') = T)om(m&x((p 2 )) D Im(^i); let R" = Dom(max(</92)) H Im(^i). 

Obviously, R' C i?" (since y?2 is a restriction of max(</32))- Moreover, i?' is essential in R" , by 
Lemma ll.81 indeed, T>om(<p 2 ) is essential in Dom(max(9?2)) since max(9?2) is an essential extension of 
ip 2 . Since R' is essential in R" , <p 2 o<pi is an essential restriction of max^) °*pi- Hence by uniqueness 
of the maximal essential extension, m&x(m&x((p 2 ) Pi) = max(p 2 o max(<^i)). □ 

Proof of Prop. II. 71 If <p 2 = if) 2 then, by definition, max(9?2) = m&x(i{j 2 ), hence by Lemma ll. 101 
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m&x((f2 o (p) = max(max(<^2) V?) = max(max(^2) <p) = max(^2 9?), 
for all 99 G -R/fc- Thus (by the definition of =), <p>2 = 1P2 so = is a right congruence. Similarly 
one proves that = is a left congruence. Thus, RIk/ = is a monoid. 

Since every =-equivalence class contains exactly one element of M^i there is a one-to-one cor- 
respondence between RIk/ = and M^i- Moreover, the map cp G RIk 1 — > max(<y9) G 1 is a 
homomorphism, by Lemma 11.101 and by the definition of multiplication in \. Hence RIk/ = is 
isomorphic to Mk s i- □ 

1.2 Other Thompson-Higman monoids 

We now introduce a few more families of Thompson-Higman monoids, whose definition comes about 
naturally in analogy with M/-^. 

Definition 1.11 The Thompson-Higman total function monoid totM^i and the Thompson-Higman 
surj ective function monoid surMk^i consist of maximal essential extensions of homomorphisms between 
finitely generated right ideals of A* where the domain, respectively, the image ideal, is (inessential right 
ideal. 

The Thompson-Higman inverse monoid Invk,i consists of all maximal essential extensions of iso- 
morphisms between finitely generated (not necessarily essential) right ideals of A* . 

Every element ip G totM^i can be described by a function P — > Q, called the table of ip, where 
P, Q C A* with P a finite maximal prefix code over A. A similar description applies to surMk t i but 
now with Q a finite maximal prefix code. Every ip G Invk,i can be described by a bijection P — > Q 
where P, Q C A* are two finite prefix codes (not necessarily maximal) . 

It is easy to prove that essential extension and restriction of right ideal homomorphisms, as well 
as composition of such homomorphisms, preserve injectiveness, totality, and surjectiveness. Thus 
totMk,i, surMk,i, and Invk,i are submonoids of Mfc^. 

We also consider the intersection totMk^i D surM^i, i.e., the monoid of all maximal essential 
extensions of homomorphisms between finitely generated essential right ideals of ^4*; we denote this 
monoid by totsurMf-i- The monoids M^i, totMk^i, surMk t i, and totsurMk^i are regular monoids. (A 
monoid M is regular iff for every m G M there exists x G M such that mxm = m.) The monoid 
Invk,i is an inverse monoid. (A monoid M is inverse iff for every m G M there exists one and only 
one x G M such that mxm = m and x = xmx.) 

We consider the submonoids totlnvk,! and surlnvk^i of Invk,i, described by bijections P — > Q 
where P,Q C A* are two finite prefix codes with P, respectively Q maximal. The (unique) inverses 
of elements in totInvk,i are in surlnvk,i, and vice versa, so these submonoids of Invk,i are not regular 
monoids. We have totInvk : i n surlnvk : i = G^i (the Thompson-Higman group). 

It is easy to see that for all n > 0, M/% 5 i contains the symmetric monoids PFk" of all partial 
functions on k n elements, represented by all elements of Mf. 1 with a table P — > Q where P,Q^L A n . 
Hence M^i contains all finite monoids. Similarly, totM^i contains the symmetric monoids of all 
total functions on k n elements. And Inv^i contains Xj.n (the finite symmetric inverse monoid of all 
injective partial functions on A n ). 

1.3 Cuntz algebras and Thompson-Higman monoids 

All the monoids, inverse monoids, and groups, defined above, are submonoids of the multiplicative 
part of the Cuntz algebra Ok- 

The Cuntz algebra Ok, introduced by Dixmier [13] (for k = 2) and Cuntz [11 j . is a fc-generated 
star-algebra (over the field of complex numbers) with identity element 1 and zero 0, given by the 
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following finite presentation. The generating set is A = {a±, . . . , a^}. Since this is defined as a star- 
algebra, we automatically have the star- inverses {ai, . . . ,5fc}; for clarity we use over lines rather than 
stars. 

Relations of the presentation: 
OjOj = 1, for i = 1, . . . , k; 
aidj = 0, when i ^ j, 1 <i,j < k; 
aiai + . . . + a^au = 1. 

It is easy to verify that this defines a star-algebra. The Cuntz algebras are actually C*-algebras with 
many remarkable properties (proved in [llj), but here we only need them as star-algebras, without 
their norm and Cauchy completion. 

In [3] and independently in [23] it was proved that the Thompson-Higman group l is the 
subgroup of Ok consisting of the elements that have an expression of the form YlxeP f( x ) ^ w h ere we 
require the following: P and Q range over all finite maximal prefix codes over the alphabet {a\, . . . , a^}, 
and / is any bijection P —* Q. Another proof is given in [19|. More generally we also have: 

Theorem 1.12 The Thompson-Higman monoid Mk t i is a submonoid of the multiplicative part of 
the Cuntz algebra Ok- 

Proof outline. The Thompson-Higman partial function monoid M^i is the set of all elements of Ok 
that have an expression of the form YlxeP f( x ) x where P <Z A* ranges over all finite prefix codes, 
and / ranges over functions P — > A* . 

The details of the proof are very similar to the proofs in [3l[23]; the definition of essential restriction 
(and extension) and Proposition 11.41 insure that the same proof goes through. □ 

The embeddability into the Cuntz algebra is a further justification of the definitional choices that 
we made for the Thompson-Higman monoid i- 

2 Structure and simplicity of the Thompson-Higman monoids 

We give some structural properties of the Thompson-Higman monoids; in particular, we show that 
Mfc i and InVki are simple for all A:. 

2.1 Group of units, J-relation, simplicity 

By definition, the group of units of a monoid M is the set of invertible elements (i.e., the elements 
u G M for which there exists x € M such that xu = ux = 1 , where 1 is the identity element of M) . 

Proposition 2.1 The Thompson-Higman group Gk,i is the group of units of the monoids M^i, 
totMki, surMki, totsurMk i, and Invk,\- 

Proof. It is obvious that the groups of units of the above monoids contain G^i. Conversely, we want 
to show that that if cp G M^x (and in particular, if (p is in one of the other monoids) and if (p has a 
left inverse and a right inverse, then (p 6 Gk^i- 

First, it follows that <p is injective, i.e., <p E Invk,\- Indeed, existence of a left inverse implies that 
for some a £ M/~ i we have a p = 1; hence, if <p(xi) = p{x2) then x\ = a <p{x\) = a <p{x2) = x^- 

Next, we show that domC(<y9) is a maximal prefix code, hence tp G totInVk,i- Indeed, we can again 
consider a £ M^i such that a ip = 1. For any essential restriction of 1 the domain code is a maximal 
prefix code, hence domC(a o p) is maximal (where o denotes functional composition). Moreover, 
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domC(a o ip) is also contained in the domain code of some restriction of ip, since <p(x) must be denned 
when a o tp(x) is defined. Hence domC(i^'), for some restriction <// of 92, is a maximal prefix code; it 
follows that domC((£>) is a maximal prefix code. 

If we apply the reasoning of the previous paragraph to tp" 1 (which exists since we saw that (p is 
injective), we conclude that domC^ -1 ) = imC(</?) is a maximal prefix code. Thus, ip E surlnVki- 

We proved that if ip has a left inverse and a right inverse then ip E totlnvf-i n surlnVki- Since 
totlnVki H surlnvki = Gp-i we conclude that <p € Gjt i- Q 

We now characterize some of the Green relations of M^i and of Invk i, and we prove simplicity. 

By definition, two elements x,y of a monoid M are J -related (denoted x =j y) iff x and y belong 
to exactly the same ideals of M. More generally, the J-preorder of M is defined as follows: x <j y 
iff x belongs to every ideal that y belongs to. It is easy to see that x =j y iff x <j y and y <j x; 
moreover, x <j y iff there exist a, (3 E M such that x = ay (3. A monoid M is called J -simple iff M 
has only one J-class (or equivalently, M has only one ideal, namely M itself). A monoid M is called 
Q-J -simple iff M has exactly two J-classes, one of which consist of just a zero element (equivalently, 
M has only two ideals, one of which is a zero element, and the other is M itself). See |XQ(, 116] for more 
information on the J-relation. Cuntz [IT] proved that the multiplicative part of the C*-algebra Ok is 
a O-J-simple monoid, and that as an algebra Ok is simple. We will now prove similar results for the 
Thompson-Higman monoids. 

Proposition 2.2 The inverse monoid Invk,i and the monoid M^i are O-J-simple. The monoid 
totMj-i is J -simple. 

Proof. Let <p E M^i (or E Invki)- When ip is not the empty map there are xo,yo E A* such that 
yo = <p(xo). Let us define a,f3 G Invk,i by the tables a = {(e 1— » xq)} and (3 = {(yo 1— > e)}. Recall that 
e denotes the empty word. Then (3<pa{.) = {(e 1— » e)} = 1. So, every non-zero element of 1 (and 
of Invk,i) is in the same J-class as the identity element. 

In the case of totM^i we can take a = {(e 1— > xo)} as before (since the domain code of a is {e}, 
which is a maximal prefix code), and we take j3' : Q 1— > {e} (i.e., the map that sends every element of Q 
to e), where Q is any finite maximal prefix code containing yo- Then again, (p a(.) = {(e 1— > e)} = 1. 
□ 

Thompson proved that V (= G^i) is a simple group; Higman proved more generally that when k is 
even then G^i is simple, and when k is odd then G^i contains a simple normal subgroup of index 
2. We will show next that in the monoid case we have simplicity for all k (not only when k is even). 
For a monoid M, "simple", or more precisely, "congruence-simple" is defined to mean that the only 
congruences on M are the trivial congruences (i.e., the equality relation, and the congruence that 
lumps all elements of M into one congruence class). 

Theorem 2.3 The Thompson-Higman monoids InVki and Mk\ are congruence- simple for all k. 

Proof. Let = be any congruence that is not the equality relation. We will show that then the whole 
monoid is congruent to the empty map 0. We make use of 0- J-simplicity, proved in Proposition 12.21 
If we have $ = for some element <I> ^ of Invk t i or M/u 1 then for all a, (3 G Invki or E Mk \ we 
have obviously a§ (3 = 0. moreover, by 0- J-simplicity of Mk\ we have Mk\ = {a <3? (3 : a, (3 € 1} 
for any $ £ \ with $ 7^ 0. Hence, all elements of M^i are = 0. Similarly, by 0- J-simplicity of 
Invk,i we have Invkti = {a$/3 : a, (3 E Inv^i} for any $ E Invk,i 3> 7^ 0. Hence, all elements of 
Invk : i are = 0. 
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If we have <p = ip and (p ^ ip, for any elements (p,ip of Invk,i — {0} or of Mux — {0}, then there 
exist xo,yo,yi £ A* such that v?(xo) = Do ^ Hi = ip(xo). Consider a,/3 £ Invk,i C Mj^i, defined by 
the tables a = {(yo l— ► yo)}i an d /9 = {( x o l— * x o)}- Then a<p/3(.) = {(xq h- > yo)}, an d (xiftPi-) = 0. 
So, a<p (3 = aip/3, aipfl ^ 0, but aift/3 = 0. Hence the previous paragraph, applied to $ = a <^/3, 
implies that the entire monoid is = 0. □ 

2.2 D-relation 

Besides the J-relation and the J-preorder, based on ideals, there are the R- and L— relations and 
R- and L— preorders, based on right (or left) ideals. Two elements x,y £ M are R-related (denoted 
x =r y) iff x and y belong to exactly the same right ideals of M. The R-preorder is defined as follows: 
x <r y iff x belongs to every right ideal that y belongs to. It is easy to see that x =r iff x <r y and 
U <r x; also, x <r y iff there exists a £ M such that x = ya. In a similar way one defines =l and 
<Xj. Finally, there is the D -relation of M, which is defined as follows: x =d V iff there exists s € M 
such that x =r s =lD\ this is easily seen to be equivalent to saying that there exists t £ M such that 
x =l t =r y. For more information on these definitions see for example |101 116j . 

The D-relation of M}- \ and Invki has an interesting characterization, as we shall prove next. We 
will represent all elements of Mfe x by tables of the from ip : P — > Q, where both P and Q are finite 
prefix codes over A (with \A\ = k). For such a table we also write P = domC(^) (the domain code 
of (p) and Q = imC((/?) (the image code of <p). In general, tables of elements of M^i have the form 
P — > S, where P is a finite prefix code and S is a finite set; but by using essential restrictions, if 
necessary, every element of Mjt i can be given a table P — > Q, where both P and Q are finite prefix 
codes. 

Note the following invariants with respect to essential restrictions: 

Proposition 2.4 Let <pi : P\ — > Q\ be a table for an element of M]~\, where P\,Q\ C A* are finite 
prefix codes. Let ip2 '■ Pi — > Q2 be another finite table for the same element of M^i, obtained from the 
table (pi by an essential restriction. Then Pi^Qi C A* are finite prefix codes and we have 

\Pi \ = IP2I mod (k — 1) and 

|Qi| = IQ2I mod (fc-1). 
These modular congruences also hold for essential extensions, provided that we only extend to tables 
in which the image is a prefix code. 

Proof. An essential restriction consists of a finite sequence of essential restriction steps; an essential 
restriction step consists of replacing a table entry (x, y) of <px by {{xa\, ya\), . . . , (xaj~, ya^)} (according 
to Proposition [L4]). For a finite prefix code Q C A* , and q £ Q, the finite set (Q — {q})U{qai, . . . , gafc} 
is also a prefix code, as is easy to prove. In this process, the cardinalities change as follows: |Pi| 
becomes \Pi\ — 1 + k and \Qi\ becomes \Qi\ — 1+k. Indeed (looking at Q\ for example), first an element 
y is removed from Qi, then the k elements {ya\, . . . , yak} are added. The elements yoj that are added 
are all different from the elements that are already present in Q\ — {y}; in fact, more strongly, yai and 
the elements of Q\ — {y} are not prefixes of each other. □ 

As a consequence of Prop. 12.41 it makes sense, for any tp £ Mjt i, to talk about |domC(</?)| and 
|imC(</?) I as elements of Zfc_i, independently of the representation of (p by a right-ideal homomorphism. 

Theorem 2.5 For any non-zero elements (p,ip of Mk t i (or of Invk,i) the D-relation is characterized 
as follows: 

ip =d ip iff |imC(<£>)| = |imC(V ; )| mod (fc — 1). 
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Hence, Mk t \ and Invki have k — 1 non-zero D-classes. In particular, M21 and Inv 2,1 are O-D-simple 
(also called O-bisimpleJ. 

The proof of Theorem 12.51 uses several Lemmas. 

Lemma 2.6 ([1] Lemma 6.1; Arxiv version of [I] Lemma 9.9). For every finite alphabet A and every 
integer i > there exists a maximal prefix code of cardinality 1 + — 1) i. And every finite maximal 
prefix code over A has cardinality 1 + (\A\ — 1) i, for some integer i > 0. 

It follows that when \A\ = 2, there are finite prefix codes over A of every finite cardinality. □ 

As a consequence of this Lemma we have for all ip G Gk,i- IMI = 1 mod (k — 1). Thus, except for the 
Thompson group V (when k = 2), there is a constraint on the table size of the elements of the group. 

In the following id.Q denotes the element of Inv^i given by the table {(x 1— > x) : x G Q} where 
Q C A* is any finite prefix code. 

Lemma 2.7 (1) For any tp G M k 1 (or G Inv^ i) with table P — > Q (where P,Q are finite prefix 
codes) we have: <p =r idg. 

(2) If S,T are finite prefix codes with \S\ = \T\ then ids =D id-r- 

(3) If pi : Pi — ► Qi and ip2 : P2 — > Q2 are such that \Q\\ = \Q 2 \ then ipi =d ¥>2- 

Proof. (1) Let P' C P be a set of representatives modulo <p (i.e., we form P' by choosing one element 
in every set (p~ 1 ip(x) as x ranges over P). So, \P'\ = \Q\. Let a G Invp.,1 be given by a table Q — > P'; 
the exact map does not matter, as long as a is bijective. Then 99 o a(.) is a permutation of Q, and 

(po a =r p o a o [<p o a) -1 = idg. 

Now, 99 <^9 o a >r ip o ao (p o a)~ l o ip = idq o tp = tp, hence tp =r p o a (=# idg). 

(2) Let a : S — > T be a bijection (which exists since \S\ = |T|); so a represents an element of Inv/-^. 
Then a = a o ids(.) and idg = a -1 o a(.); hence, a =l idg. 

Also, a = idx ° a(.) and id-r = a o a _1 (.); hence, a idy. Thus, ids =L ct =R idr- 

(3) If |Qi| = |Q 2 | then id Ql = D id Q2 by (2). Moreover, p 1 = D id Ql and <p 2 =d idg 2 by (1). The 
result follows by transitivity of =r>- 1=1 

Lemma 2.8 (1) For any m > k let i be the residue of m modulo k — 1 in the range 2 < i < k, 
and let us write m = i + (k — for some j > 0. Then there exists a prefix code Qij of cardinality 
\Qi,j\ = m , such that idQ i; . is an essential restriction o/id{ aij Hence, id^ . = idr ai) ... )ai } as 

elements oflnvki- 

(2) In M fej i and in Inv}-,i we have id{ ai} = D id{ 0lj ... )<lfc } = 1. 

Proof. (1) For any m > k there exist i,j>0 such that 1 < i < k and m = i + (k — We consider 
the prefix code 

Qij = {a 2 ,...,ai} U Ur=l a l( A -{«i}) u 4 A - 
It is easy to see that Qij is a prefix code, which is maximal iff i = k; see Fig. 1 below. Clearly, 
\Qi.j\ = i + (k — l)j. Since contains a\A, we can perform an essential extension of idQ^ by 
replacing the table entries {{a\a\, a\ai), (a\a2, a\a2), ■ ■ ■ , (a^ak, a\ak)} by (a\,a\). This replaces Qij 
by Qij-\. So, idg i:) . can be essentially extended to idQ ij _ 1 . By repeating this we find that idg^ is 
the same element (in M^i and in Inv^^i) as idQ. = id/ 0l) 

(2) By essential restriction, id{ ai} = id{ aiaijaia2v .. jaiafe }, in M k) i and in Inv k) i. And by LemmaE2l2), 
id{ aiai)aia2) ... )aiafe } =d id{ aii ... iafc} ; the latter, by essential extension, is 1. □ 
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Fig. 1: The prefix tree of Qij. 



/ 



J- 1 



^ ^{a 2 ,...,ai} 
ai {A-{ai}) 



a\ {A-{a x }) 



A 



a{- l {A-{a x }) 



Lemma 2.9 For all <p,ip £ Inv^\: If ip >UM k x ) ip> where > L(M k {\ * s ^ e L-preorder of M^ \, then 
f >L(i kl ) V'; where >L(i kl ) is the L-preorder of Invk,i- 

The same holds with >l replaced by =l, >r, =r, =d, >j and =j. 



Proof. If ip = cap for some a £ i then let us define a' by a' = a id 



ip ip 1 = a if ip 1 
« idi m (^) <p = Oi(f = ip. 



a id 

□ 



Im(ip) • 



Then we have: 



Im(ip) 



a', hence a' £ Invk,i (since (p,ip £ Invk jL ). Moreover, a' 93 



So far our Lemmas imply that in M^i and in Invi-i, every non-zero element is =d to one of the 
k — 1 elements id/ 0l ^i., f° r i = 1, • • • , A; — 1. Moreover the Lemmas show that if two elements of 
Mfc 5 ! (or of Invh i) are given by tables ipi : Pi — > Qi and 992 : P2 — ► Q2> where Pi, Qi, P2 and Q2 are 
finite prefix codes, then we have: If \Q±\ = IQ2I mod (k — 1) then ipi =d <p>2- 

We still need to prove the converse of this. It is sufficient to prove the converse for Invj-i, by 
Lemma [23] and because every element of M^i is =d to an element of Invk,i (namely idr ai) ... )0j i). 



Lemma 2.10 Let ip, ip € Invp-i. If (p =o ip in Inv^x, then ||<^| 



mod (k — 1). 



Proof. (1) We first prove that if cp =l ip then |domC(<£>)| = |domC( , i/ ; )| mod (k — 1). 

By definition, ip =x ip iff <p = /3 ip and ip = cup for some a,f3 £ Invk,i- By Lemma 11.91 there are 
restrictions (5' and ip' of (3, respectively ip, and an essential restriction $ of cp such that: 

$ = ft o ip', and Dom(/?') = Im(V>')- 
It follows that Dom(<I > ) C Dom(ip'), since if ip'{x) is not defined then $(x) = (3' o ip'{x) is not defined 
either. Similarly, there is an essential restriction \& of ip and a restriction ip' of 9? and such that 
Dom(^) C Bom(ip'). 

Thus, the restriction of both ip and ip to the intersection Dom(<I>) nDom(f ) yields restrictions ip" , 
respectively ip" such that Dom((/j") = Dom(?//'). 
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Claim: p" and ip" are essential restrictions of <p, respectively ip. 

Indeed, every right ideal R of A* that intersects Dom(^) also intersects Dom( 1 J r ) (since Vl/ is an 
essential restriction of ip). Since Dom($) C Dom(<^') C Dom(c^), it follows that R also intersects 
Dom(yj). Moreover, since is an essential restriction of p, R also intersects Dom($). Thus, Dom($) 
is essential in Dom(^). Since Dom(f ) is also essential in Dom(^), it follows that Dom(<I>) n Dom(f) 
is essential in Dom(^); indeed, in general, the intersection of two right ideals R±, R2 that are essential 
in a right ideal R3, is essential in R3 (this is a special case of Lemma ll.8p . This means that ip" is an 
essential restriction of ip. Similarly, one proves that ip" is an essential restriction of ip. [This proves 
the Claim.] 

So, ip" and ip" are essential restrictions such that Dom(<^") = Dom(ip"). Hence, domC^") = 
domC(V'"); Proposition 12,41 then implies that |domC(</?)| = |domC(<//')| = |domG(^")| = |domC(^)| 
mod (k — 1). 

(2) Next, let us prove that if p =r ip then |imC(</?)| = |imC(^)| mod (A; — 1). 

In Invk 1 we have tp =r ip iff p>~ 1 =l ip~ l ■ Also, imC(y) = domC(^ -1 ). Hence, (2) follows from 

(I)- 

The Lemma now follows from (1) and (2), since for elements of Inu^i, |imC(</?)| = |domC(^)| = \\<p\\, 
and since the D-relation is the composite of the L-relation and the i?-relation. □ 

Proof of Theorem [2751 We saw already (in the observations before Lemma [2 . 1 1 and in the preceding 
Lemmas) that for <p\ : P\ — > Q\ and <p>i : Pi — > Q2 (where P±, Qi, P2 and Q2 are non-empty finite 
prefix codes we have: If \Q\\ = \Qy\ mod (k — 1) then ip\ =d ip^. In particular, when \Q\\ = i mod 
(k - 1) then p x = D id {aii .., a } . 

It follows from Lemma 12.101 that the elements idr ai) a a (for i = 1, . . . , k — 1) are all in different 
L>-classes. □ 

So far we have characterized the D- and J-relations of Mk t i and Invk,i- We leave the general study 
of the Green relations of M^i, Inv^i, and the other Thompson-Higman monoids for future work. The 
main result of this paper, to be proved next, is that the Thompson-Higman monoids Mf. \ and Inv^ 1 
are finitely generated and that their word problem over any finite generating set is in P. 

3 Finite generating sets 

We will show that Invk,i and Mj~ 7 i are finitely generated. An application of the latter fact is that a 
finite generating set of M^i can be used to build combinational circuits for finite boolean functions 
that do not have fixed- length inputs or outputs. In engineering, non-fixed length inputs or outputs 
make sense, for example, if the inputs or outputs are handled sequentially, and if the possible input 
strings form a prefix code. 

First we need some more definitions about prefix codes. The prefix tree of a prefix code P C A* 
is, by definition, a tree whose vertex set is the set of all the prefixes of the elements of P, and whose 
edge set is {(x, xa) : a £ A, xa is a prefix of some element of P}. The tree is rooted, with root e (the 
empty word). Thus, the prefix tree of P is a subtree of the tree of A*. The set of leaves of the prefix 
tree of P is P itself. The vertices that are not leaves are called internal vertices. We will say more 
briefly an "internal vertex of P" instead of internal vertex of the prefix tree of P. An internal vertex 
has between 1 and k children; an internal vertex is called saturated iff it has k children. 

One can prove easily that a prefix code P is maximal iff every internal vertex of the prefix tree of 
P is saturated. Hence, every prefix code P can be embedded in a maximal prefix code (which is finite 
when P is finite), obtained by saturating the prefix tree of P. Moreover we have: 
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Lemma 3.1 For any two finite non-maximal prefix codes P\,P 2 C A* there are finite maximal prefix 
codes P{,Pf, C A* such that P x C P[, P 2 C P' 2 , and \P{ \ = \P^\. 



Proof. First we saturate Pi and P 2 to obtain two maximal prefix codes P{' and P'^ such that Pi C P", 
and P 2 C P 2 '. If \P{'\ / \P!{\ (e.g., if \P['\ < |P 2 "|) then \P{'\ and \P!{\ differ by a multiple of k - 1 
(by Prop. [23]). So, in order to make \P{'\ equal to \P 2 \ we repeat the following (until \P{'\ = \P 2 \)- 
consider a leaf of the prefix tree of P" that does not belong to Pi , and attach k children at that leaf; 
now this leaf is no longer a leaf, and the net increase in the number of leaves is k — 1. □ 



Lemma 3.2 Let P and Q be finite prefix codes of A* with \P\ = \Q\. If P and Q are both maximal 
prefix codes, or if both are non-maximal, then there is an element ofGk,i that maps P onto Q. On the 
other hand, if one of P and Q is maximal and the other one is not maximal, then there is no element 
°f Gk,i that maps P onto Q. 



Proof. When P and Q are both maximal then any one-to-one correspondence between P and Q is 
an element of Gk,i- 

When P and Q are both non- maximal, we use Lemma 13. II above to find two maximal prefix codes 
P' and Q' such that P C P' , Q C Q' , and |P'| = \Q'\. Consider now any bijection from P' onto Q' 
that is also a bijection from P onto Q. This is an element of Gki- 

When P is maximal and Q is non-maximal, then every element <p G Mfc : i that maps P onto Q will 
satisfy domC(^) = P; since (p is onto Q, we have imC((/j) = Q. Hence, ip ^ Gk,i since imC(y) is a 
non-maximal prefix code. A similar reasoning shows that no element of Gk,i maps P onto Q if P is 
non-maximal and Q is maximal. □ 

Notation: For u, v € A*, the element of Inv^i with one-element domain code {u} and one-element 
image code {v} is denoted by (u h «). When (u i— > v) is composed with itself j times the resulting 
element of Invk,± is denoted by (u i— ► v) 3 . 



Lemma 3.3 (1) For all j > 0: (ai i— > a\ai) 



(ai 



(2) Let S = {a\ai, a\a 2 , . . . , a\ai}, for some 1 < i < k — 1, < j. Then ids is generated by the k + l 
elements {(fli i-> a x a x ), (aid h-> ai)} U {id {aiaij 0102) aiai} : 1 < i < k - 1}. 

(3) For all j> 2: (e i-> a{)(.) = (ai ^ dai) i_1 ■ (e d)(-)- 



Proof. (1) We prove by induction that (ai i— 
Indeed, (ai i— > aiai) J+1 (.) = (ai i— > aiai) • (ai 



aid)"' = (d | — ► aia|) for all j > 1. 
-> aiaj)(.), and by essential restriction this is 



aia{ 



aiw (w € A j - {a{}) 
aiaiw 



(2) For S = {ajai, a{a 2 , . . . , ajaj} we have 
ids = 

and 



(ai i— > aia{)(.) = (ai i— > aiaiai)(.). 



aiai 
a\a\ 



a\a 2 
a{a 2 



a\ai 
a{ai 



V a i«i 



aia 2 
aia 2 



a\a l 
a\a. 



(■) 



/aid 
Va^ai 



aia 2 
a{a 2 



aiaj 
ai a 



faxax 


aia 2 




aiaj 


d a i+i 




V«i«i 


a{a 2 




a{aj 


a 1 aj+i 





aia fc 



a{a k 



id; 



(ai 



a{)- id 



{aiai, aia2, aia^} 



(ai 



{aiai, aia2 

aiaiV - 1- id 



aiaj}(-) 



{aiai, aia2, aia;}- 
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The map id{ aiai } is redundant as a generator since (aiai i— » a\ai) = (ai«i i— > ai) (ai h- ► aiai)(.). 
(3) By (1) we have (e i— > a{) = (ai i— > a{) • (e i— > ai)(.), and (ai h- > aj) = (ai i— > aiai) J . □ 

Theorem 3.4 TTie inverse monoid Inv^i is finitely generated. 

Proof. Our strategy for finding a finite generating set for Inv^i is as follows: We will use the fact 
that the Thompson-Higman group Gk,i is finitely generated. Hence, if p £ Invi-,1, 9i,92 € Gk,i, and 
if g2'P9i can be expressed as a product p over a fixed finite set of elements of Inv^x, then it follows 
that ip = pgi can also be expressed as a product over a fixed finite set of elements of Invk,i- We 
assume that a finite generating set for Gk,i has been chosen. 

For any element ip € Invk \ with domain code domC(<£>) = P and image code imC(</?) = Q, we 
distinguish four cases, depending on the maximality or non-maximality of P and Q. 

(1) If P and Q are both maximal prefix codes then ip E Gj.i, and we can express <p over a finite fixed 
generating set of Gk,i- 

(2) Assume P and Q are both non-maximal prefix codes. By Lemma [3.11 there are finite maximal prefix 
codes P',Q' such that P C P', Q C Q', and \P'\ = \Q'\; and by Lemma UM \P'\ = \Q'\ = l + (k-l)N 
for some N > 0. Consider the following maximal prefix code C, of cardinality \P'\ = \Q'\ = l+(fc— 1)N: 

C = U?S 2 a[(A-{ ai }) U a 1 ?- 1 A. 

The maximal prefix code C is none other than the code Qij when i = k and j = N — 1 (introduced 
in the proof of Lemma 12.81 Fig- !)• The elements g± : C — > P' and g<i : Q' — > C of can be chosen 
so that ip = g2pgi{.) is a partial identity with domC(V ) ) = imC(V') C C consisting of the \P\ first 
elements of C in the dictionary order. So, tp is the identity map restricted to these \P\ first elements 
of C, and tp is undefined on the rest of C. To describe domC(V ; ) = imC(V') in more detail, let us write 
\P\ = i + (k - 1) £, for some with 1 < i < k and < £ < N - 1. Then 

domC(V) = imC(^) = af" 1 ^ U \J%T? +1 a\{A - {a x }) U oj {a 2 , . . . , a,}. 

where j = N — 1 — £. Since V = iddomC(i/>) > we claim: 

By essential maximal extension 

ip = ids ( a s elements of Invk,i), where S = {a\ai,a\a2, • • • , a{ai}, 

with i,j as in the description of domC(V') = imC(V0 above, i.e., 1 < i < k, N — l>j = N — l — £>0, 
and \P\ = i + (k- 1)1 

Indeed, if \P\ < k then S is just domC(^), with i = \P\, and £ = (hence j = N — 1). If 
|P| > A; then the maximum essential extension of ip will replace the 1 + (k — 1)£ elements a 1 I ~ 1 A U 
U^T/v-j+i a i(^ — { a i}) by the single element a^~ t+l = a\ +1 . What remains is the set 

S = {a{ +1 } U a{ {a 2 , . . . ,aj}. 

Finally, by Lemma [3.31 ids (where 5 = {a^ai, a{a2, . . . , a\ai\) can be generated by the k + 1 
elements {(ai ^ aiai), (01O1 i-> ai)} U {id {aiaij oia2) aia . } : 1 < * < — 1}. 

(3) Assume P is a maximal prefix code and Q is non-maximal. Let Q' be the finite maximal prefix code 
obtained by saturating the prefix tree of Q. Then Q C Q', \Q'\ = l + (k-l)N', and |P| = l + (fe-l)iV 
for some N 1 > N > 0. We consider the maximal prefix codes C and C as defined in the proof of (2), 
using N' for defining C . We can choose g\ : C — > P and 92 '■ Q' ^ C in Gfc,i so that ^ = g2pg\{.) is 
the dictionary-order preserving map that maps C to the first |C| elements of C . So we have 

domC(V>) = C, and 

imC(^) = Sq , where 5q C C consist of the \C\ first elements of C, in dictionary order. 
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Since \C\ = 1 + (k — 1) N, we can describe Sq in more detail by 
S = \£j_ N attA-{ ai }) U a?'- 1 A. 

Next, by essential maximal extension we now obtain ip = (e h- > _Ar ). 

Indeed, we saw that \P\ = l + (k- 1) iV. If |P| = 1 then P = {e}, and ^ = (e i-> af ). If |P| > k 
then maximum essential extension of ip will replace all the elements of C by the single element e, and 
it will replace all the elements of Sq by the single element ~ N . 

Finally, by Lemma 1331 (e i— > _Ar ) is generated by the two elements (ff i— ► ai) and (ai i— ► aiai). 

(4) The case where P is a non-maximal maximal prefix code and Q is maximal can be derived from 
case (3) by taking the inverses of the elements from case (3). □ 

Theorem 3.5 The monoid M^i is finitely generated. 

Proof. Let ip : P — > Q be the table of any element of Mfe i, mapping P onto Q, where P, Q C A* 
are finite prefix codes. The map described by the table is total and surjective, so if |P| = \Q\ (and in 
particular, if tp is the empty map) then <p € Invk,i, hence cp can be expressed over the finite generating 
set of Invk,x- In the rest of the proof we assume |P| > \Q\. The main observation is the following. 

Claim. <p can be written as the composition of finitely many elements (pi S i with tables Pj — > Qi 
such that < |P| - \Qi\ < 1. 

Proof of the Claim: We use induction on \P\ — \Q\. There is nothing to prove when |P| — \Q\ < 1, so 
we assume now that |P| — \Q\ > 2. 

If <p{x\) = p(x 2 ) = ^(^3) = 2/1 for some x\,X2,x% € P (all three being different) and 2/1 € Q, then 
we can write ip as a composition </?(.) = ^2 °^l(-)> as follows. The map ip% : P — ► P — {x±} is defined 
by ipi(xi) = ip\(x2) = X2, and acts as the identity everywhere else on P. The map ip2 : P — {xi} — ► Q 
is defined by ^2(^2) = ^2(^3) = 2/1, and acts in the same way as <p everywhere else on P — {xi}. Then 
for ipi we have |P| — |P — {x±}\ < \P\ — \Q\, and for 1P2 we have |P — {a?i}| — \Q\ < \P\ — \Q\. 

If (p(x{) = ip(x2) = yi and <p{xz) = p{xZ) = 2/2 for some x%, X2, X3, X4 G P (all four being different) 
and 2/1,2/2 € Q (2/1 7^ 2/2), then we can write cp as a composition = ip2 V'i(-), as follows. First the 
map ^1 : P — > P — {x±} is defined by ipi(xi) = ^1(^2) = ^2, and acts as the identity everywhere else 
on P. Second, the map ip2 '■ P — {x\} — ► Q is defined by ^2(^2) = 2/1 and ^2(^3) = ^2(^4) = 2/2, and 
acts like (p everywhere else on P — {xi}. Again, for ipi we have |P| — |P — {xi}| < |P| — |Q| and 
for tp2 we have |P — {xi}| — \Q\ < \P\ — \Q\. [End, proof of the Claim.] 

Because of the Claim we now only need to consider elements <p £ 1 with tables P — > Q such 
that the prefix codes P, Q satisfy |P| = |Q| + 1. We denote P = {p%, . . . ,p n } and Q = {q±, . . . , q n -i}, 
with (/'(Pj) = 9j for 1 < j < n — 1, and ip(p n -i) = <p(pn) = <Zn-l- We define the following prefix code 
C with \C\ = |P|: 

• if |P| = i < k then C = {ai, . . . , ai}; note that i > 2, since |P| > |Q| > 0; 

• if |P| > A; then C = {a 2 ,...,ai}U \J r l\ a\{A - {ax}) U a{A, 

where i,j are such that |P| = i + (k — 2 < i < k, and 1 < j (see Fig. 1). Let us write C in 
increasing dictionary order as C = {c%, . . . , c n }. The last element of C in the dictionary order is thus 

We now write </?(.) = ^3 ip2 V'lCO w here ^1, V2, ^3 are as follows: 

• ipx : P — > C is bijective and is defined by pj 1— > Cj for 1 < j < n; 

• ip2 ■ C — ► C — {a{\ is the identity map on {c\, . . . , c ra _i}, and ip2(cn) = c n -i- 

• vp3 : C — {ai} — ► Q is bijective and is defined by Cj 1— * qj for 1 < j < n — 1. 

It follows that tpi and ips can be expressed over the finite generating set of Inv^i. On the other 
hand, ip2 has a maximum essential extension, as follows. 
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If 2 <\P 

1p2 = 
If |P 

If \P\ = ■ 
max(^2) 



i < k 


then 








a.i-2 




a. 




Oj-2 


Ci-l 





id 



{ai, ai_i} 



a. 



i + (A; — l)j > k and if i > 2 then, after maximal essential extension, ip2 also becomes 

Ot-l 



(k — l)j > k and if i = 2 then, after essential extensions, 

/aid ... a\ak-2 a\ak-i a\au a 2 \ _ fid ai A 
laiai . . . aiafc_2 aiafc_i aiafc aia^ 



0-2 



ai 
ai 



0-2 

aiafc 



In summary, we have factored 9? over a finite set of generators of Inv^i and k additional generators 
in Mfci. □ 



Factorization algorithm: The proofs of Theorems 13.41 and 13.51 are constructive; they provide algo- 
rithms that, given ip G Inv^i or € 1, output a factorization of </> over the finite generating set of 
Invi-,1, respectively M^i. 

In [T7] (p. 49) Higman introduces a four-element generating set for G^i; a special property of these 
generators is that their domain codes and their image codes only contain words of length < 2, and 
that J \"/(x) I — \x\ \ < 1 for every generator 7 and every x € domC(7). The generators in the finite 
generating set of 1 that we introduced above also have those properties. Thus we obtain: 

Corollary 3.6 The monoid M27 has a finite generating set such that all the generators have the 
following property: The domain codes and the image codes only contain words of length < 2, and 
I \ j(x) I — \x\ < 1 for every generator 7 and every x G domC(7). □ 

For reference we list an explicit finite generating set for -M2J. It consists, first, of the Higman generators 
of G? 2 ,i Q17J p. 49): 



Not 



71,2 




1 

00 
00 





01 
10 



(01 <-» 1) 



/oo 


01 


Voo 


1 



01 



(0 +4 10) 



00 
00 



01 

1 



01 



and 



10 
01 



11 
11 



the additional generators for Inv2,i'- 

(e->0), (0->e), (0->00), (00 
the additional generators for M2 1 : 



o); 



and 



01 



00 
00 



01 
01 



01 



Observe that Higman's generators of Gk t i (in [T7] p. 27) have domain and image codes with at most 
3 internal vertices. We observe that the additional generators that we introduced for Inv^i and Mf. 1 
have domain and image codes have at most 2 internal vertices. 



The following problem remains open: Are Invt,\ and M/^i finitely presented? 
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4 The word problem of the Thompson-Higman monoids 

We saw that the Thompson-Higman monoid Mfc i is finitely generated. We want to show now that the 
word problem of \ over any finite generating set can be decided in deterministic polynomial time, 
i.e., belongs to the complexity class P. It follows immediately that all finitely generated submonoids 
of Mfe i have their word problem (over any finite generating set) in P. 

In [3] it was shown that the word problem of the Thompson-Higman group G^i over any finite 
generating set is in P. In fact, it is in the parallel complexity class ACi [3], and it is co-context-free 
|22j . In [1] it was shown that the word problem of the Thompson-Higman group G^^i over the infinite 
generating set T^ i U {Tjj +1 : i > 0} is coNP-complete, where T^i is any finite generating set of G^i; 
the position transposition 73^+1 6 G& 1 has domC(rj j j + i) = imC^j+i) = A t+1 , and is defined by 
ua(5 1— ► uj3a for all letters a, j3 G A and all words u G A 1 " 1 . We will see below that the word problem 
of Mfc 1 over Tjt 1 U {th+x : z > 0} is also coNP-complete, where Y^x is any finite generating set of 

4.1 The image code formula 

Our proof (in [3]) that the word problem of G^i (over any finite generating set) is in P was based on 
the following fact (the table size formula): 

Vip,1>€G ki x: W°<P\\ < WW + IMI- 

Here \\ip\\ denotes the table size of 99, or equivalently, the cardinality of domC(yj). See Proposition 3.5, 
Theorem 3.8, and Proposition 4.2 in [3]. In 1 the above formula does not hold in general, as the 
following example shows. 

Proposition 4.1 For every n > there exists & n = ^ £ ^2,1 (for some 991,992 G M2 : x) wii/i 
i/ie following properties: 

The table sizes are ||$ n || = 2 n , and H992II = ll^ill = 2. So, ||3>n|| * s exponentially larger than 
(n — 1) • 11^2 1| + Hvill- Hence the table size formula does not hold in Mix- 

The word lengths 0/991, 992, and <£ n ('over i/ie finite generating set of Mi t i from the previous Section) 
satisfy \(px\ = 1, \<f2\ < 2, and |$ n | < 2ra. £0 i/ie iafr/e size of <& n is exponentially larger than its 
word length: ||$ n || > 2l*"l/ 2 . 

Proof. Consider 991,992 G -^2,1 given by the tables 991 = {(0 1— > 0), (1 1— > 0)}, and 992 = {(00 1— > 
0), (01 1— > 0)}. One verifies that $ n = 992 _1 o 991 (.) sends every bitstring of length n to the word 
0; its domain code is {0, 1}™, its image code is {0}, and it is its maximum essential extension. Thus, 
||992 -lo ^i|| = 2 n , whereas (n — 1) • \\(pz\\ + WfiW = 2 • n. Also, = (0 1— >• 0, 1 1— > 0) • (0 1— >• s), so 

j^l = l ; \(p 2 \ < 2, and |$> n | < In — 1. □ 

Definition 4.2 T/ie table size 0/ i/te right-ideal homomorphism 9 : PA* — > QA* where P,Q C A* are 
prefix codes, is by definition \\0\\ = \P\. 

The length of the longest word in the table P — > Q of 9 is denoted by 1(0); formally, 1(9) = 
max{|s| : s G domC(6>) U imC(0)}. 

For any finite prefix code Q C A* we also denote the length of the longest word in Q by £(Q) . 

We will use the following facts that are easy to prove. If R C A* is a right ideal and 99 is a 
right-ideal homomorphism then ip(R) and ip~ 1 (R) are right ideals. We also need the following result 
(Lemma 3.3 of [3]): If P,Q,S C A* are such that PA* (1 QA* = SA*, and if S is a prefix code then 
SCPUQ. 
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Before proving Theorem 14.51 that generalizes the table size formula to the monoid case we need 
two Lemmas. 

Lemma 4.3 Assume 9 : PA* — > QA* is a right-ideal homomorphism, and assume SA* C PA* , where 
P,Q,Sc A* are finite prefix codes. Then there is a finite prefix code R C A* such that 9(SA*) = RA* 
and R C 9(S). 

Proof. Since 9 is a right-ideal homomorphism we have 6(SA*) = 9(S) A*. Since 9(S) might not be 
a prefix code we take R = {r E 9(S) : r is minimal (shortest) in the prefix order within 9(S)}. Then 
R is a prefix code that has the required properties. □ 

Lemma 4.4 Let 9 be a right-ideal homomorphism with image Im(8) = QA* such that Q C A* is a 
prefix code. Then 9~ 1 (Q) is a prefix code, and domC(#) = 9~ 1 (Q). 

Proof. First, 9~ l (Q) is a prefix code. Indeed, if we had x\ = X2U for some x\,x 2 E 9~ 1 {Q) with u 
non-empty, then 9{x\) = 9(x2) u, with 9(xi),9(x 2 ) E Q. This would contradict the assumption that 
Q is a prefix code. 

Second, 9~\Q) A* = 9~ 1 {QA*). Indeed, for all 9~ 1 (Q) C 9~ l (QA*), hence 9~ 1 {Q) A* C 
9~ 1 (QA*). Moreover, if y £ QA* then any element of 9~ 1 (y) has the form rw for some r £ 9~ 1 {QA*) 
such that no prefix of r is in 9~ 1 (QA*). Since we just saw that 9~ l (Q) is a prefix code, it follows that 
r £ 9~ 1 (Q); hence, 9~ 1 (QA*) C 9~ 1 (Q) A*. 

Finally, since 9 _1 (Q) A* = 9~ 1 (QA*), and since 9~ 1 (Q) is a prefix code, it follows that domC(0) = 
9-\Q). □ 

The next Theorem is a useful generalization of the "table size formula" of Gk,i to the monoid M^\. 

Theorem 4.5 (Image code formula). Let ipi : Pi A* — > Qi^4* and 932 : P2A* — > Q2^4* be right- 
ideal homomorphisms, where Px,P 2 ,Qi,Q 2 C A* are finite prefix codes. Then 

(1) |imC((/? 2 </?i)| < |imC(</?2)| + |imC(pi)| , 

(2) % 2 o W ) < % 2 )+%x) . 

Proof. (1) The proof is similar to the proof of Proposition 3.5 in [3]. We have Dom(c^2 </?i) = 
V 3 r 1 ( ( 3i^* ^ -^2-^*) an d Im(992 </?i) = <p 2 (Q\A* fl P2A*). So the following maps are total and onto, 
on the indicated sets: 

i P ^ 1 (Q 1 A*nP 2 A*) ^ Q 1 A*nP 2 A*^ MQiA* nP 2 A*). 

By Lemma 3.3 of [3] (quoted above) we have Q\A* n P 2 A* = SA* for some finite prefix code S with 
S C Qi U i-*2- Moreover, by Lemma 14.31 we have (^(S^*) = P^2^4* for some finite prefix code P2 such 
that R 2 C (/^(S 1 )- Now, since S C Qi U P 2 we have P 2 C (/^(S 1 ) C (f 2 (Qi) U ^2(^2) = ^(Ql) U (?2- 
Thus, |imC(^ 2 ° </>i)l = \R 2 \ < \MQi)\ + IQ2I < + |Q 2 |. 

(2) We want to bound the length of words in imC(<p 2 o ipi) and in domC(<^2 </?i)- 

(2. a) Let us first look at imC(</?2 </?i)- We saw above that imC(i^2 </>l) = R2 Q </>2(Qi) U Q 2 . The 

longest words in Q 2 are of length < l((p 2 ) ( < ^(^2) + ^(^l))- 

On the other hand, for a longest word y in ip 2 {Qi) we have the following: y = (p 2 {qi) for some 
Qi £ Qif^P 2 A* (we have q\ £ P2-A* since ip 2 is defined on gi). Thus, q\ = p 2 w for some p 2 € P 2 ,w € A*, 
hence < |<?i|. Now y = (f 2 (p 2 w) = (f 2 {p 2 ) w, hence \y\ = \<p 2 (p 2 )\ + \w\ < ^(<^ 2 ) + |gi| < £(f 2 )+£((pi). 
(2.b) Let us now look at domC(y?2 ¥>i)- We saw above that Dom((p 2 o = ip^ 1 (SA*), where 
S £ Qi U P 2 . By Lemma \4A\ domC(i^2 fl) = fi 1 (S). Hence, domC(</?2 yi) = ( P\ 1 (S) Q 
(f^iQi) U ^{Pi) C P x U ^r X (P2). Let us consider x £ Pi U ^(Pj). 
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For x e P\ we obviously have |x| < i(^fi) ( < ^(^2) + 

On the other hand, consider a longest word x in (^^(i^)- We have x E Pi A* since </>i is defined 
everywhere on y?^ 1 ^). Therefore, x = p\W for some pi G Pi, w G A*. So, (^i(x) = <£>i(pi) hence 
\w\ < |</3i(a;)|; and since (fi(x) € P2 we have |<^i(x)| < ^(922); so, \w\ < ^(^2)- Thus, |x| = |pi| + |w| < 
%i)+% 2 ). □ 

For elements of Invk,i the image code has the same size as the domain code, which is also the table 
size. Thus Theorem 14.51 implies: 

Corollary 4.6 For all tp,ifi 6 Invk,i-' o (p\\ < \\ip\\ -\- \\ip\\. □ 

In other words, the table size formula holds for Inv^^i- Another immediate consequence of Theorem 
14.51 is the following. 

Corollary 4.7 Let ipi : Pi A* — > QiA* be right-ideal homomorphisms for i = 1, . . . , n, where Pi,Qi C 
A* are finite prefix codes. Let c\,C2 be positive constants. 

(1) If \Qi\ < ci for all i then |imC((/? n o . . . o ipi)\ < c% n. 

(2) If £((fi) < C2 for all i then £((p n o . . . o ipi)\ < C2n. 

So, if \Qi\ < ci and l((fi) < C2 /or all i then imC(<^ n o . . . o ipi) consists of a linearly bounded number 
(< c\n) of words, each of linearly bounded length (<C2n). □ 

The position transposition nj (with < i < j) is, by definition, the partial permutation of A* 
which transposes the letters positions i and j; T^j is undefined on words of length < j. More precisely, 
we have domC(rjj) = imC(Tjj) = A 3 , and uav(3 1— > u(3va for all letters a, (5 € A and all words 
u £ A 1 ^ 1 and v € A^ 1 " 1 . In this form, r» »• is equal to its maximum essential extension. 

Corollary 4.8 The word-length of Tij over any finite generating set of M^i is exponential. 

Proof. We have |imC(rjj)| = h? . The Corollary follows then from Corollary 14.7( 1). □ 

4.2 Some algorithmic problems about right-ideal homomorphisms 

We consider several problems about right-ideal homomorphisms of A* and show that they have de- 
terministic polynomial-time algorithms. We also show that the word problem of Mk t i over r^iU 
{r^i+i : < i} is coNP-complete, where T^i is any finite generating set of M^i. This will help us with 
the complexity analysis of the word problem of the Thompson-Higman monoids M^i, and provide 
other corollaries of independent interest. 

Lemma 4.9 There are deterministic polynomial time algorithms for the following problems. 
Input: Two finite prefix codes Pi, P2 C A* , given explicitly by lists of words; 

Output 1: The finite prefix code H C A* such that UA* = P\A* n P2A* , where II is produced explicitly 
as a list of words. 

Question 2: Is P\A* n P2^4* essential in P\A* (or in P2A* , or in both)? 

Proof. We saw already that II exists and II C Pi U P2 (see Lemma A.l and Lemma 3.3 of [3]). 
Algorithm for Output 1: Since we know that II C Pi U P2, we just need to search for the elements of 
II within Pi U P2. For each x G Pi we check whether x also belongs to P 2 ^4* (by checking whether 
any element of P2 is a prefix of x). Since Pi and P2 are explicitly given as lists, this takes polynomial 
time. Similarly, for each x G P2 we check whether x also belongs to Pij4*. Thus, we have computed 
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the set IIi = (-Pi Hi^^*) U (P2 C\P\A*). Now, II is obtained from IIi by eliminating every word that 
has another word of IIi as a prefix. Since IIi is explicitly listed, this takes just polynomial time. 

Algorithm for Question 2: We first compute IT by the previous algorithm. Next, we check whether 
every p\ € Pi and every P2 € P2 is a prefix of some r £ II; since Pi,P%, and IT are explicitly listed, 
this takes just polynomial time. □ 

Notation: We denote the unique prefix code that generates a right ideal R C A* by prefC(-R). We 
observe that if cp\ : P\A* — > Q\A* and if2 '■ P2A* — > Q2A* are right-ideal homomorphisms, where 
Pi, Qi, P2, Q2 C A* are finite prefix codes, then imC(</?2 ¥>i(-)) = P re fC(</22(Qi^4*))- 

Lemma 4.10 The following input-output problem is in P. 

• Input: A finite prefix code So C A* (given explicitly by a list of words), and n right-ideal homo- 
morphisms (fi : P{A* — ► QiA* for i = 1, . . . , n (given explicitly by finite tables); Pi, Qi C A* are finite 
prefix codes. 

• Output: The finite prefix code prefC(99 n o . . . o ipi(SoA*)) , given explicitly by a list of words. 

Proof. We first prove the Lemma in case n = 1. We note that prefC(</?i(Sb^*)) = ^1 (II), where 
IT is the prefix code that generates the right ideal So A* n PA*. By Lemma I4.9|, II is finite and 
can be explicitly found in deterministic polynomial time. From IT, an explicit list for ^i(IT) can 
be obtained in time polynomial in |IT|, £(H), ||<£>i|| and £(tpi). By Theorem 14.51 applied to ip± and 
\d So we have: |prefC(</>i(S 4*))| = |imC(pi o id So )| < |imC(^i)| + |5 |, and %i(S A*)) = 
£(ipi o \ds ) < £{<pi) + £{Sq). So the total time for computing 991 (II) is polynomial in terms of the 
size of the input. Let pi(||9?i ||, £(<pi), \So\,£(Sq)) be a polynomial upper bound for finding (pi(H) from 
the input. 

To prove the Lemma in general, we compute the sequence of finite prefix codes S\, S2, ■ ■ ■ , S n , 
where Si = prefC(ifi(Si-iA*)) for i = 1, . . . , n. For this we repeatedly use the case n = 1 above, thus 
computing Si from Sj+i and ifi in deterministic time < p^(||</3j||,£(y3j), \Si-i\,£(Si-\)), where p[ is a 
polynomial. By Theorem 14.51 applied to ifi-i, ■ ■ ■ , (fi, \ds we have: 

ISi-ll = \\mC(<pi- 1 o...O(p 1 (S Q A*))\ < \S \ + Er=i l' mC (yr)| , 

£{St-i) = £(prefC(<p i - 1 o...o n (S A*))) < £(S ) + Et=i%i) • 
So, and £{Si-\) are linearly bounded in terms of the input size, hence 

MiPi\\^i)ASi-i\,t{Si-i)) < p l {\Wi\\,£(^i),\So\,£{So)) , 
where pi is a polynomial. Finally, the total time is at most El-iKOI^illi^'/ 9 *)) \So\,£{So)), which is 
a polynomial in terms of the input size. □ 

Corollary 4.11 The following input-output problem has a deterministic polynomial-time algorithm. 

• Input: Right-ideal homomorphisms (fj : PjA* — > QjA* (for j = l,...,n), where Pj,Qj C A* are 
finite prefix codes; each ipj is explicitly given by its table. 

• Output: The set imC(<£ n o . . . o <px), given explicitly by a list of words. 

Proof. This is a special case of Lemma 14. 101 with Sq = {e}. □ 

When we consider the word problem of M^i over a finite generating set, we measure the input 
size by the length of input word (with each generator having length 1). But for the word problem 
of Mk,i over the infinite generating set T^i U {rj-i^ : i > 1} we count the length of the position 
transpositions Tj-i^ as i, in the definition of the input size of the word problem. Indeed, at least log 2 % 
bits are needed to describe the subscript i of Tj_i j. Moreover, in the connection between M^i (over 

1 U { T i-i,i '■ i > 1}) an d circuits Ti—n is interpreted as the wire-crossing operation of wire number 
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i and wire number i — 1; this suggests that viewing the size of r$_i $ as i is more natural. In any case, 
we will see next that the word problem of i over T^i U {t^h : i > 1} is coNP-complete, even if 
the size of Tj_i j is more generously measured as i; this is a stronger result than if log 2 i were used. 

Theorem 4.12 (coNP-complete word problem). The word problem of M^i over the infinite 
generating set T^i U {rj_i j : i > 1} is coNP -complete, where T^i is any finite generating set of x . 

Proof. In [1] (see also [2]) it was shown that the word problem of the Thompson-Higman group G^i 
over To,, j U {Tj_x,j : i > 1} is coNP-complete, where r^,. 1 is any finite generating set of Gk : i- Hence, 
since the elements of the finite set Ta k 1 can be expressed by a finite set of words over I\i, it follows 
that the word problem of i over T^i U {t,j_i $ : i > 1} is also coNP-hard. 

We will prove now that the word problem of M^i over Tki U {ji—n : i > 1} belongs to coNP. The 
input of the problem consists of two words (p m , ■ ■ ■ , pi) and (a n , . . . , <7i) over r^i U {rj-i^ : i > 1}. 
The input size is the length X^fc=i \Ph\ + Sj=i where each generator in r^i has length 1, and 
each generator of the form Tj_i j has length i. 

Since Tk s i is finite there is a constant c > such that c > £(j) for all 7 6 r& 1; also, for each 
Tj-i^ that occurs in {p m , . . . , pi} U {<7 n , . . . , a\} we have i(ri-\ t i) = i. By Theorem 14.51 (2), the table 
of a n o . . . o o\ (and more generally, the table of Uj o . . . o o\ for any j with n > j > 1) only contains 
words of length < YZj=i £( a j)i an d similarly for p m o . . . o pi (and for pi o . . . o p%, m > i > 1). So all 
the words in the tables for any cr,- o . . . o o\ and any pi o . . . o p\ have lengths that are linearly bounded 
by the size of the input ((p m , . . . ,pi), (a n , . . . ,01)). 

Claim. Let N = max{^]™ x £(pi), Yll=i ^( a j)}- Then p m ■ . . . ■ p\ ^ a n ■ . . . ■ o\ as elements of Mk,\ 
iff there exists x G A N such that p m o . . . o p±(x) 7^ a n o . . . o o"i(x). 

Proof of the Claim: As we saw above, the tables of p m o . . . o p\ and a n o . . .0 a\ only contain words of 
length < N. Thus, restricting p m o . . . o p\ and o~ n o . . . o o~\ to A N A* is an essential restriction, and 
the resulting tables have domain codes in ^4^. Therefore, p m ■ . . . ■ p\ and o~ n ■ . . . ■ o~\ are equal (as 
elements of Mf, 1) iff p m o . . . o pi and a n o . . . o o~\ are equal on A . [End, Proof of Claim] 

Based on the Claim, we obtain a nondeterministic polynomial-time algorithm which decides (non- 
deterministically) whether there exists x £ A N such that p m o . . . o p\{x) 7^ tr n o . . . o o~\(x), as 
follows: 

The algorithm guesses x G computes p m o . . . o p\{x) and <r n o . . . o <ti(x), and checks that 
they are different words (€ A*) or that one is undefined and the other is a word. Applying Theorem 
14.51 (2) to p m o . . . o pi o id^iv and to a n o . . . o 01 o id^jv shows that \p m o . . . o pi(x)\ < 2N and 
\o~ n o . . . o o"i(x)| < 2A^. Also by Theorem 14.51 (2), all intermediate results (as we successively apply 
Pi for i = 1, . . . , m, or <jj for j = 1, . . . ,n) are words of length < 2A^. These successive words are 
computed by applying the table of pi or Oj (when pi or Oj belong to T^ i), or by directly applying the 
position permutation T^h-l (if Pi or Uj is Th,h-l)- Thus, the output p m o . . . o pi(x) (and similarly for 
o~ n o . . . o <7i(x)) can be computed in polynomial time. 

The above is a nondeterministic polynomial-time algorithm for the negated word problem. Hence 
the word problem of M^i over T^i U {Tj-i^ : i > 1} is in coNP. □ 

4.3 The word problem of Mk,i is in P 

We now move ahead with the the proof of our main result. 

Theorem 4.13 (Word problem in P). The word problem of the Thompson-Higman monoids 1, 
over any finite generating set, can be decided in deterministic polynomial time. 
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We assume that a fixed finite generating set T^i of Mfc ; i has been chosen. The input consists of 
two sequences (p m , . . . , pi) and (a n , . . . , a\) over r^i, and the input size is m + n. We want to decide 
in deterministic polynomial time whether, as elements of M^i, the products p m ■ . . . ■ p\ and o~ n ■ . . . -o\ 
are equal. 

Overview of the proof: 

• We compute the finite sets imC(p m o ... o p^, imC(cr n o . . . o o\) C A*, explicitly described by 
lists of words. By Corollary 14.111 we can do this in polynomial time, and these sets have polynomial 
size. (Note however that by Proposition 14.1^ the table sizes of p m o . . . o p\ or a n o . . . o o\ could be 
exponential in m or n.) 

• We check whether Im(p m o ... opi) n Im(c n o . . . o a±) is essential in Im(p m o . . . o p{j and in 
Im(cr n o . . . o <7i). By Lemma 14.91 (Question 2) this can be done in polynomial time. If the answer is 
"no" then p m ■ . . . ■ p\ ^ o~ n ■ . . . ■ <?\, since they don't have a common maximum essential extension. 
If "yes" , we continue. 

• We compute the finite prefix code LT C A* such that 11^4* = Im(p m o . . .o p\) n Im(a n o ... o a\). 
By Lemma 14.91 (Output 1) this can be done in polynomial time, and LT has polynomial size. 

• For every r € LT we compute a deterministic finite automaton (DFA) accepting the finite set 
(p m o . . . o pi) _1 (r) C A*, and a DFA accepting the finite set (a n o . . . o cr 1 ) _1 (r) C A*. By Corollary 
14.151 this can be done in polynomial time, and the DFAs have polynomial size. (Note that the finite 
sets themselves could have exponential size in m or n.) 

• For every r G LT we check whether the DFAs for (p m o . . . o p 1 ) _1 (r), respectively (a n o . . . o ( ri)~ 1 (r), 
are equivalent. By classical automata theory, this can be done in polynomial time. 

These DFAs are equivalent for all v G LI iff (pm 

o...opx) 1 (r) = (a n o . . . octi) 1 (r) for all r € LT. 
Since 11^4* is essential in Im(p m o. . . opi) and in Im(cr n o. . . oo"i), this holds iff p m ■ . . . ■ p\ = u n ■ . . . -a\ 
(in Mk t i). [End of Overview.] 

Automata — notation and facts: In the following, DFA stands for deterministic finite automaton. 
The language accepted by a DFA A is denoted by C(A). A DFA is a structure (5, A, 5, so, F) where 
S is the set of states, A is the input alphabet, so £ S is the start state, F C S is the set of accept 
states, and 5 : S x A — > S is the next-state function; in general, 5 is a partial function. We extend 
the definition of 6 to a function S x A* — > S by defining 5(s, w) to be the state that the DFA reaches 
from s after reading w (for any w € A* and s G S) . See |18} [2T] for background on finite automata. A 
DFA is called acyclic iff its underlying directed graph has no directed cycle. It is easy to prove that 
a language L C A* is finite iff L is accepted by an acyclic DFA. Moreover, L is a finite prefix code 
iff L is accepted by an acyclic DFA that has a single accept state (take the prefix tree of the prefix 
code, with the leaves as accept states, then glue all the leaves together into a single accept state). 
By the size of a DFA A we mean the number of states, IS"!, of the DFA; we denote this by size(^l). 
By the min- depth of a DFA A with single accept state we mean the length of the shortest path from 
the start state to the accept state; we denote this by mindepth(^l). (We use the term "min-depth" to 
avoid confusion with the usual concept of "depth" of an acyclic graph, which refers to the length of 
the longest path from a source vertex to a sink vertex.) For a finite prefix code P C A* we denote the 
length of the longest word in P by £(P), and we define the total length of P by ||P|| = J2 x eP \ x \- 

For a language L C A* and a partial function : A* — > A* , we define the inverse image of L under 
$ by = {x G A* : $(x) G L}. 

For L C A* we denote the set of all strict prefixes of the words in L by spref(L). 

The reason why we use acyclic DFAs to describe finite sets is that a finite set can be exponentially 
larger than the number of states of a DFA that accepts it; e.g., A n is accepted by an acyclic DFA 
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with n + 1 states. This conciseness plays a crucial role in our polynomial-time algorithm for the word 
problem of M^i- 

Lemma 4.14 Let A be an acyclic DFA with a single accept state. Let <p : PA* — > QA* be a right- 
ideal homomorphism, where P, Q C A* are finite prefix codes. We assume that £(Q) < mindepth(„4), 
and that ip- l {C{A)) ^ 0. 

Then <p~ l (C(A)) is accepted by a one- accept- state acyclic DFA whose size is < size(^4) + ||-P||, 
and whose rain-depth is > mindepth(^l) — £(Q). Moreover, the transition table of this DFA can be 
constructed deterministically in polynomial time, based on the transition table of A and the table of (p. 

Proof. Let A = (S, A, 5, sq, {sa}) where sa is the single accept state; sa has no out-going edges (they 
would be useless). For any set X C A* and any state s G S we denote {5(s,x) : x G X} by S(s,X). 
Recall that spref(Q) denotes the set of all strict prefixes of the words in Q. Since A is acyclic, its 
state set S can be partioned into the following two sets: S(so, spref (Q)), and 5(so,QA*). The block 
(J(sO)Spref(Q)) is non-empty since it contains so; the block S(so,QA*) is non-empty because of the 
assumption ip~ l (C(A)) ^ 0. 

Since C(A) is a prefix code and ip is a right-ideal homomorphism, <p~ l (C{A)) is a prefix code. To 
accept <^ _1 (£(.4)) we introduce an acyclic DFA with single accept state, called ip~ 1 (A), constructed 
as follows: 

• State set of <p>~ l {A): spref (P) U 6{s , QA*). 

The start state is e, i.e., the root of the prefix tree of P. 
The accept state is the accept state sa of A. 

• State-transition function 5± of Lp^ 1 {A): 

For every r G spref (P) and a G A such that ra G spref (P): 5\(r,a) = ra. 
For every r G spref (P) and a G A such that ra G P: 5\(r, a) = S(sq, ip(ra)). 
For every s G 5(sq, QA*): 5\(s,a) = S(s,a). 

It follows immediately from this definition we have for all p G P: 5\(s,p) = S(so, f(p))- 

The DFA p^ 1 (A) can be pictured as being constructed as follows: The DFA has two parts. The first 
part is the prefix tree of P, but with the leaves left out (and with the leaf edges dangling). The second 
part is the DFA A restricted to the state subset S(so, QA*). The two parts are connected together by 
gluing each (hypothetical) leaf p G P to the state <5(soj ¥>(p)) £ 5(so, QA*). 

The description of ip~ x {A) constitutes a deterministic polynomial time algorithm for constructing the 
transition table of ip^{A), based on the transition table of A and on the table of tp. 

We will prove now that the DFA ^^(A) accepts exactly (p' 1 (C(A)); i.e., (p~ 1 (C(A)) = £(tp~ l (A)). 

[C] Consider any y G C(A) such that tp~ l (y) ^ 0. We want to show that cp~ 1 (A) accepts all the 
words in ip^ 1 (y). Since (^ _1 (y) /0we have y = qw for some strings q G Q = imC(<£>) and w G A*. 
Since Q is a prefix code, q and w are uniquely determined by y. Moreover, since y G C(A) it follows 
that y = qw has an accepting path in A of the form 

so S(s ,q) s A - 

Then for every x G ^^{y) we have x = pv for some strings p G P and v G A*, so ip(x) = ip(p) v; we 
also have <p(x) = qw, hence (p(p) and q are prefix-comparable. Therefore, <p(p) = q, since Q is a prefix 
code, and hence v = w. Thus every x G (p~ 1 (qw) has the form pw for some string p G <p>~ l {q). Now in 
ip^i^A) there is the following accepting path on input x = pw G (p~ 1 (qw) = (p~ l {q) w: 

e h{e,P) = S(s ,tp(p)) -^U s A - 
Thus ip -1 (A) accepts x = pw. 
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[5] Suppose <p~ l (A) accepts x. Then, because of the prefix tree of P at the beginning of if" 1 (A), x 
must have the form x = pw for some strings p £ P and w £ A*. The accepting path in ip~ 1 (A) on 
input pw has the form 

so <5i(e,p) = <5(s Q) ^(p)) s A - 

Also, ^(x) = qw where q = <p(p) £ Q. Hence .A has the following path on input qw: 

s 5(s ,?) = £(so,¥>(p)) 

So, £ £(.4). Hence, s £ ip~ x {qw) C yr 1 ^.*)). Thus C ^(jC^)). □ 

Corollary 4.15 Let *4 6e an acyclic DFA with a single accept state. For % = 1, . . . , n, let Pi,Qi C A* 
be finite prefix codes, and let ipi : PiA* — > QiA* be a right-ideal homomorphism. We assume that 
Yn=iKQi) ^ mindepth(^l), and that (<p n o . . . o (p^- 1 (£(A)) + 0, 

Then [<p n o ... o ipi)~ 1 (C(A)) is accepted by a one- accept- state acyclic DFA whose size is < 
size(.4) + J27=i \\Pih an ^ whose min-depth is > mindepth(„4) — Y^i=x^{Qi)- 

Moreover, the transition table of this DFA can be constructed deterministically in polynomial time, 
based on the transition table of A and the tables of ipi (i = 1, . . . , n). 

Proof. We use induction on n. For n = 1 the Corollary is just Lemma 14.141 

Let n > 0, assume the Corollary holds for n homomorphisms, and consider one more right-ideal 

homomorphism cpo : PoA* — > QqA* , where Po, Qo C A* are finite prefix codes. Assume Y^i=oKQi) — 

mindepth(^l), and assume (<p n o . . . o y^o ipo)~ 1 (C(A)) ^ 0. 

Since (ip n o . . . o ipio tpo)" 1 (C(A)) = f^ 1 o (ip n o . . . o <pi)~ 1 (C(A)), let us apply Lemma T4.14I to 

(fo and the DFA (ip n o . . . o ipi)^ 1 ^). The hypothesis that £(Qo) is at most equal to the min-depth 

of this DFA holds; indeed, E" =0 ^(<2i) ^ mindepth(^l) implies 1{Q ) < mindepth(^) - Y!i=iKQi) < 

mindepth(((/3 n o . . . o<p x o ( / 9 )~ 1 (»4)). 

The conclusion of Lemma [4. 141 is then that (ip n o . . . oipio ipQ)~ 1 (C(A)) is accepted by a DFA whose 

size is < size(((/9 n o . . . o ^i)" 1 ^)) + £(P ) < s\ze(A) + Ya=i \\ p i\\ + K p o)- And the min-depth of 

this DFA is > mindepth((<p n o . . . o ip^iA)) - i(Q ) > mindepth(.A) - J2i=i £ (Qi) ~ KQo)- u 

Proof of Theorem l4?L3t 

Let (p m , . . . , pi) and (a n , . . . , a\) be two sequences of generators from the finite generating set r^i. 
We want to decide in deterministic polynomial time whether the products p m ■ . . . ■ p\ and a n ■ . . . ■ a\ 
are the same (as elements of Mk,i)- 

First, by Corollary 14.111 we can compute the sets imC(p m o . . . o p{) and imC(<r n o . . . o <ti), 
explicitly described by lists of words, in polynomial time. By Lemma 14.91 we can check in polynomial 
time whether the right ideal Im(p m o . . .o p\) n Im(cr„ o . . . o ax) is essential in Im(p m o . . .o p\) and in 
Im(cr n o . . . o o~i). If not, we immediately conclude that p m ■ . . . ■ p% ^ o~ n ■ . . . ■ o~\. Otherwise, Lemma 
14.91 lets us compute a generating set n for the right ideal Im(p m o . . . o p{) n Im(cr n o . . . o ai), in 
deterministic polynomial time; this generating set n will be a finite prefix code, given explicitly by a 
list of words. By Corollary 14. 71 and because n C imC(/3 m o . . . o p\) U imC(cr„ o . . . o a±), H has linearly 
bounded cardinality and the length of the longest words in n is linearly bounded. 

To find out whether p m ■ . . . ■ p\ = a n ■ . . . - ax, it is sufficient to check whether (p m o . . . op 1 ) _1 (r) = 
(er n o. . . o(7]) _1 (r) for every r £ n, since HA* is essential in both Im(p m o . . . op 1 ) and Im(<7„, o...o<7j). 

Let A = maxl^j £(imC(pi)), Y?j=i ^(irnC(a"j))}. For every r £ n we have (p m o. . .op 1 ) _1 (r) ^ 
and (a n o . . . o cri) _1 (r) ^ 0, because n C Im(p m o . . . o p{) n Im(cr n o . . . o a±). 

If |r| > A then Corollary 14. 151 implies that (p m o . . . o is accepted by an acyclic one-accept- 
state DFA A p , which can be constructed deterministically in polynomial time; similarly, we construct 
an acyclic one-accept-state DFA A a which accepts (a ri o . . . o ai)~ l {r). 
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If |r| < A, we replace r by rA^' r \. It is easy to see that r j4 A_ ' r is accepted by an acyclic 
single- accept-state DFA with A + 1 states. By Corollary 14.151 (p m o . . . o p 1 ) _1 (r A x ~\ r \) is accepted by 
an acyclic one-accept-state DFA A p , which can be constructed deterministically in polynomial time. 
Similarly, we construct an acyclic one-accept-state DFA A a which accepts (a n o , .. o <7i)~ l (r A A_ l r l). 

Obviously, (p m o . . . o / o 1 ) _1 (r yl A_ l' r l) = (a n o . . . o <7i) _1 (r ^4 A ~l r l) (or, in case |r| > A, (p m o . . . o 
(r) = (<r n o . . . o <7i) _1 (r)) if and only if A p and A a accept the same language, i.e., they are 
equivalent DFAs. It is well known (see e.g., [18], or [21] pp. 103-104) that the equivalence problem for 
DFAs that are given explicitly by transition tables, is decidable deterministically in polynomial time. 
This proves Theorem 14.131 □ 
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